Applying a Hybrid Query Translation Method to Japanese/English Cross-Language Patent Retrieval

نویسندگان

  • Masatoshi Fukui
  • Shigeto Higuchi
  • Youichi Nakatani
  • Masao Tanaka
  • Atsushi Fujii
  • Tetsuya Ishikawa
چکیده

This paper applies an existing query translation method to cross-language patent retrieval. In our method, multiple dictionaries are used to derive all possible translations for an input query, and collocational statistics are used to resolve translation ambiguity. We used Japanese/English parallel patent abstracts to perform comparative experiments, where our method outperformed a simple dictionary-based query translation method, and achieved 76% of monolingual retrieval in terms of average precision.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

結合線上翻譯服務的跨語言專利檢索系統 (Integrating Query Translation and Text Classification in a Cross-Language Patent Retrieval System) [In Chinese]

In this paper, we introduced a cross-language patent retrieval system which combined the various free web translators on the internet. The bi-gram indexing method was used to deal with the multilingual patent documents, and the query translation method was used to translate the query form the source language to the target language. Currently, this system provides the functions of the monolingua...

متن کامل

Query Translation from Indonesian to Japanese Using English as Pivot Language

In this paper, we propose a query translation method for Cross Lingual Information Retrieval (CLIR) system which works for Japanese (target language) documents with Indonesian (source language) queries. Because Indonesian-Japanese is an unfamiliar language pair, it is difficult to translate Indonesian queries to Japanese directly. Therefore, we use English as a pivot language for transitive tra...

متن کامل

A System for Japanese/English/Korean Multilingual Patent Retrieval

In response to growing needs for cross-lingual patent retrieval, we propose PRIME (Patent Retrieval In Multilingual Environment system), in which users can retrieve and browse patents in foreign languages only by their native language. PRIME translates a query in the user language into the target language, retrieves patents relevant to the query, and translates retrieved patents into the user l...

متن کامل

Applying Machine Translation to Two-Stage Cross-Language Information Retrieval

Cross-language information retrieval (CLIR), where queries and documents are in di erent languages, needs a translation of queries and/or documents, so as to standardize both of them into a common representation. For this purpose, the use of machine translation is an e ective approach. However, computational cost is prohibitive in translating large-scale document collections. To resolve this pr...

متن کامل

Exploring the use of Machine Translation resources for English-Japanese Cross-Language Information Retrieval

This paper describes a detailed investigation into the use of Machine Translation techniques for query translation in English-Japanese Cross-Language Information Retrieval (CLIR). Experimental results are reported for the standard BMIR-J2 Japanese text retrieval collection extended to a CLIR task. Results indicate that retrieval performance increases as greater linguistic processing is used in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cs.CL/0206034  شماره 

صفحات  -

تاریخ انتشار 2000